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•£} ■ Abstract 

o , 

, A theorem of Davies states that for symmetric quantum states 

pLiI there exists a symmetric POVM maximizing the mutual information. 

To apply this theorem the representation of the symmetry group has 
to be irreducible. We obtain a similar yet weaker result for reducible 
^ I representations. We apply our results to the double trines ensemble 

and show numerically that for this ensemble the pretty good measure- 
ment is optimal. 

. 1 Introduction 

One of the basic problems of quantum information theory is the quantum 
detection problem: Given an unknown element of a finite set of possible 
states we want to obtain as much knowledge as possible about this state by 
performing measurements. More precisely, we look for a positive operator- 
valued measure (POVM) that minimizes or maximizes a certain optimality 
criterion. There are different criteria for the detection of quantum states. 
For example, we can consider the detection error probability, Bayes costs PP, 
or the mutual information [21 IHj . In this article we only consider the mutual 
information of a measurement. 
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Compared to other criteria the mutual information leads to very hard 
optimization problems even for simple state sets. This is due to the logarithm 
in the definition of the mutual information whereas other criteria as the error 
probability or Bayes costs are much simpler. There is only little known about 
optimal measurements for the mutual information j2J El IH ISl E] ■ The principal 
idea for obtaining these results is to use the convex structure of the POVMs 
and the mutual information. Standard arguments for convex functions and 
sets 13 El in], e.g., Caratheodory's theorem, can be applied. Davies showed 
with these arguments that we can find optimal measurements with a certain 
number of POVM operators j2] . Furthermore, for symmetric state sets there 
exists an optimal measurement whose POVM operators constitute a single 
orbit. The proof of this theorem only works for irreducible representations 
of the symmetry group and the theorem cannot be generalized directly to 
reducible representations as the example of Refs. U El shows. This means 
that for certain state sets with a reducible representation of the symmetry 
group it is not possible to obtain an optimal POVM which is a single orbit. 

In this article we generalize Davies' theorem to reducible representations. 
The generalization states that there is an optimal symmetric POVM where we 
know an upper bound for the number of orbits. The upper bound depends on 
the number of irreducible components in the representation of the symmetry 
group. We apply the generalization to the double trine ensemble and show 
numerically that the pretty good measurement of Refs. |TU] is an optimal 
measurement for this state set. 

We proceed as follows. In the next two sections we recapitulate basic 
definitions and properties of POVMs, symmetric matrices, and the mutual 
information. In Sec.|3]we show how both of Davies' theorems can be proved 
with a theorem that directly follows from the theory of convex sets. This 
theorem leads to the generalization to reducible representations. In Sec. El 
we apply the generalized theorem to two special cases of the lifted trines. 

2 Symmetric states, POVMs, and matrices 

In this section we outline basic definitions of symmetric quantum states and 
POVMs. We show that the symmetry of POVMs naturally leads to matrices 
with symmetry. 
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2.1 Symmetric states and POVMs 

We consider a quantum system with corresponding Hilbert space C^. A 
state of the system can be described by a density matrix p G C^^'^, i.e., a 
semi-positive matrix with tr(p) = 1. In the following we refer to a state set^ 
S = {pi, . . . , pm} with corresponding prior probabilities p{i) as an ensemble. 
A pure state pi = |\E'j)(\E'j| can be described by the state vector |\E'j). A 
POVM measurement is defined by a set P = {Hi, . . . , n„} C (□'^^'^ of non- 
zero semi-positive matrices with Hi = Id where Id denotes the identity 
matrix of size d x d. The result of a measurement is an index i which occurs 
with the probability tr(njp) when p is the given state. 

The symmetry of ensembles and POVMs is defined by the invariance of 
the corresponding set of matrices under the action of a group. 

Definition 1. Let X = {Xi, . . . ,Xm} ^ (C^^"^ be the set of matrices corre- 
sponding to a POVM or ensemble. Furthermore, let G be a finite group with 
unitary representation a : G ^ C^^*^. The POVM or ensemble is symmetric 
with respect to a if X is invariant under the operation Xi i-^ a{g)XiO'{gy 
for all g E G and i G {!,..., m}, i.e., this operation defines a permuta- 
tion representation on X . For ensembles we additionally assume equal prior 
probabilities for states of the same orbit. 

Following this definition, we assume that a is a non-projective representa- 
tion. As discussed in Ref. [11 , a projective representation can be transformed 
into a non-projective representation by a central extension of G. Further- 
more, we do not assume that G operates transitively on X. This allows that 
we can consider the symmetries that are defined by the subgroups of G, too. 
In particular, the group G can be the trivial group. 

An important construction for POVMs is the symmetrization. This 
means that a POVM can be extended to a symmetric POVM as the fol- 
lowing lemma states j2]. The symmetric POVM can contain several orbits 
and the matrices need not be distinct. 



Lemma 2. Let P C (^<^^<^ he a POVM and a : G ^ (^^dxd ^ unitary repre- 
sentation of the finite group G. Then 



"'^We allow multiple copies of elements in a set of states or POVM operators, i.e., we 
consider multisets. 
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is a symmetric POVM. 



Another construction to obtain new POVMs is the convex combination: 

Definition 3. Let P = {Hi, . . . , n„} and P = {fii, . . . , n„} be two POVMs 
of a system. For A G [0, 1] define the convex combination 

AP + (1 - A)P := {AHi, . . . , An^, (1 - A)ni, . . . , (1 - A)nj. 

This convex combination corresponds to a random selection between two 
POVMs. We do not forget which POVM we have chosen after the measure- 
ment, i.e., we assume that the resuhs of both POVMs are distinct. 

2.2 Matrices with symmetry 

The matrices of a POVM are Hermitian. The d"^ matrices 



Ekk := \k){k\, Xki := + and Y^i := z\k){l\ - i\l){k\, (1) 



k > I, of size {d x d) constitute an orthogonal basis for the real linear space of 
Hermitian matrices with the trace inner product. For symmetric POVMs we 
construct specific matrices in subspaces that can be described by the theory 
of symmetric matrices fn\ . 

Definition 4. Let G be a finite group with representations a : G —>■ C'"^"* 
and T : G ^ (£<nxn^ rpj^^ matrix M G C*"^" has the symmetry (G,a,T) if 
a{g)M = Mr{g) for all g eG. We write aM = Mr. 

Due to Schur's lemma a symmetric matrix has a special structure 
which can be described with the intertwining space ^] of two representa- 
tions. 

Definition 5. Let G, a and t be as in Def. ^ The intertwining space of a 
and T is the linear space Int((T, r) := {M e C™^" : aM = Mr}. 

A matrix M has the symmetry (G, a, r) if and only if M G Int((j, r). 
Hence, the structure of a symmetric matrix is determined by the structure of 
the intertwining space. The latter can be easily described if we assume that 



z 



z 



a = 



^{Irm ® Ki) and r 



(2) 



i=l 



i=l 
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are decompositions of a and r into the irreducible representations of G. 
These decompositions can be obtained by conjugation of a and r with ap- 
propriate unitary matrices fl^. The natural numbers mi and Ui are the 
multiplicities ^3] of the irreducible representations Ki in a and r. The fol- 
lowing lemma specifies the structure of the intertwining space jl5j . 

Lemma 6. Let o and r be two representations of G with the decompositions 
ofEq. (0). Then 

lnt{a, t) = (C™ix"i ® /deg(.o) © • • • © (C'"^""^ © ^dcg(..)) 

where deg(Kj) denotes the degree of Hi. 

For mj = we insert deg(fi;j) zero columns and for ?7,j = we insert 
mi deg(/€j) zero rows at the appropriate positions. For symmetric ensembles 
and POVMs we only need a special case of this lemma. Let Xi be sym- 
metric states or POVM operators. Then C := Xli^j is invariant under the 
conjugation with cr, i.e., 

a{g)Ca{g)^ = C for all g e G. 

This means, that aG = Gcr. Using Lemma El we see that C is a Hermitian 
block-diagonal matrix with blocks that are Hermitian matrices, too. The 
following lemma determines the dimension of the intertwining space. 

Lemma 7. Let a be as in Eq. Then the Hermitian matrices in Int((j, a) 
constitute a linear space of real dimension Yli^l- 

Assume that a is irreducible. Then Int(o", cr) is an one- dimensional space 
since it contains only real scalar multiples of the identity matrix. For a 
representation of the trivial group Int (cr, cr) is the full space of matrices, i.e., 
the linear space has the dimension d^. 

3 Basic properties of mutual information 

Let S be an ensemble and P be a POVM as defined in Sec. 12.11 Using the 
conditional probability p{j\i) '■= tr(njPj) we can define the joint probability 
distribution pij := p{i)p{j\i)- With this distribution we can define the mutual 
information as in classical information theory [16J. 
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Definition 8. The mutual information of the ensemble S and POVM P is 

i=l j=l i=l \j=l J j=l \i=l J 

with H{u) = u log2 u. 

The fundamental problem is to find a POVM P that maximizes I{S, P) 
for a given ensemble S with prior probabilities p{i). The information obtained 
by an optimal measurement is called the accessible information 

We resume some properties of the mutual information which can be used 
to transform optimal measurements into a normal form. Then the optimiza- 
tion can be restricted to these POVMs. The first lemma directly follows 
from classical information theory (see Th. 2.7.4 in Ref. PH]) and essentially 
states that the mutual information is a convex function in the conditional 
probability p{j\i) for a fixed distribution p{i). 

Lemma 9. Let P = {U^, . . . , n„} as well as P = {fli, . . . , n„} be POVMs 
and S an ensemble. Define the POVM Q := {Allj + (1 — A)nj : i = 1, . . . ,n} 
for A G [0, 1]. Then the inequality 

I{S,Q) < XI{S,P) + {1-X)I{S,P) 

holds. The equality holds if and only if 

k k 

for all i and j where pij := p{i)tT(IljPi) . 

The equality condition holds exactly for POVMs with the property that 
the probability vectors {pij, . . . ,Pmj) and {pij, . . . ,Pmj) that are induced by 
Ilj and Ilj for the given ensemble are equal up to a constant factor: 

^jiP^jj ■ ■ ■ iPrnj) — dj{pij, . ■ . ,Pmj) 

for Cj,dj > with cj + dj > for each j. In other words, for the given 
ensemble both operators are indistinguishable up to the constant factor. 

The convex combination of POVMs in Lemma IHl differs from Def. El We 
obtain the latter by padding P from the right and P from the left with zero 
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operators in such a way that all combinations Allj + (1 — A)nj encompass 
one zero operator. Then we have YlikPkj = or J2kPkj = ^^r all j, i.e., we 
have Cj = or dj = 0. Consequently, the information obtained by the convex 
combination of two POVMs is the convex combination of the corresponding 
informations: 

Lemma 10. Let S be an ensemble and let P as well as P be POVMs. Then 
for all X e [0, 1] the equality 

I{S, AP + (1 - A)P) = XI{S, P) + (1 - X)I{S, P) 

holds. 

This lemma has a simple interpretation: For measurements we randomly 
choose between two devices. Then the total information we obtain is the 
weighted average of the informations for each device. 

The next lemma (see Lemma 2 of Ref. j2j) shows that POVM operators 
that are equal up to normalization can be merged without changing the 
mutual information of the POVM. The same is true if we split an operator 
Hi into An, and (1 — A)!!^ for < A < 1. This theorem can be applied 
repeatedly and to permutations of the operators, too. 

Lemma 11. Let S be an ensemble. Then I{S,P) = I{S,P) holds for the 
POVMs P = {Hi, n2, . . . , Un} and P = {AHi, (1 - A)ni, Hs, Hg, . . . , n„} 
with < A < 1. 

The optimization of POVMs can be simplified in some cases if we use a 
special normalization of the POVM operators. The following definition shows 
how a POVM can be rewritten in such a way that the resolution Yli = 
of the identity is a convex combination [2|. 

Definition 12. Let P = {IIi, . . . , n„} be a POVM with non-zero operators. 
Then write P = {Xill[, . . . , A„n^} with 

n := — — -n,- and A,- :- ^ ' 



tr(n,) d 

The identity is the convex combination Ajll^ = Id. 

The last lemma of this section is a generalization of Lemma 5 in Ref. P] 
and can be applied to symmetric ensembles. It states that for a given POVM 
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the symmetrization of this POVM has the same mutual information. Hence, 
we know that for symmetric ensembles there always exists an optimal sym- 
metric POVM. In the next section we prove the existence of a symmetric 
POVM where we know an upper bound for the number of orbits. 

Lemma 13. Let S be a symmetric ensemble with symmetry group G and let 



To apply this theorem the symmetry group need not operate transitively 
on the ensemble. The probabilities p{i) have to be constant on each orbit. 

4 Optimal POVMs for symmetric ensembles 

In the literature, the main tools for the optimization of POVMs are Davies' 
theorems j2] and their real versions • We briefly recapitulate the proofs and 
generalize the theorem for symmetric ensembles to reducible representations 
of the symmetry group. 

Davies' first theorem (Th. 3 of Ref. [2]) states that for an ensemble of 
a (i-dimensional Hilbert space there exists an optimal POVM with n rank- 
one operators where d < n < d?. Davies' proof is essentially based on the 
following lemma which deals with convex combinations of the identity. The 
set of these combinations is convex and the lemma gives an upper bound for 
the number of operators of an extreme point P] . We prove this lemma in the 
appendix with standard arguments of linear optimization. 

Lemma 14. Let Ajll'j = Id be a convex combination with tr(n^) = d. 
Furthermore, let all 11'^ be elements of the affine space Id + K where K is 
an r-dimensional linear subspace of Hermitian matrices. Then the convex 
combination can be rewritten as 



where fii, Vij > and fij = = 1 for all i. Furthermore, for each i 

at most r + 1 elements Vij are non-zero. 

Using this lemma we can prove the upper bound of Davies' first theorem 
as follows. Assume that Hi, . . . , n„ is an optimal POVM. We can assume 
that it consists of rank-one operators P]. Using the normalization of Def. ^1 



P bea POVM. Then I{S,P) = I{S,P^). 
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we have a convex combination J2i '^i^'i = Id- With Lemma HH the POVM is 
a convex combination of POVMs with at most operators each because K 
has dimension^ (P — 1. Lemmas ITUl and ITT] show that at least one of these 
POVMs is optimal, too. 

We show how Davies' second theorem (Th. 4 of Ref. [2]) follows from 
Lemma IT^ The former states that for a symmetric ensemble with irreducible 
representation a there exists an optimal POVM which is a single orbit. Let 
P be an optimal POVM with rank-one operators. Following Lemma IT^ the 
POVM P'^ is optimal, too. We consider the orbits 



Since P^ is a POVM the equation "^-XiDi = Id holds, i.e., the identity 
matrix is a convex combination of the matrices Di. We use the irreducibility 
of a and obtain due to aDi = D^a the equation Di = I^. In other words, the 
matrices Di are elements of the intertwining space Int((T, a) = CJ^. Since 
the matrices Di have trace d they are elements of the affine space Id + {0} 
whose real dimension is r = 0. Following Lemma El there exists a convex 
combination of Id with a single Di. With the same arguments as for the proof 
of Davies' first theorem a single orbit is sufficient for an optimal measurement. 

It is clear how this proof of Davies' second theorem is modified for re- 
ducible representations: The matrices Di are elements of the intertwining 
space Int(o", cr) which has dimension r := 'Y^iTof as stated in Lemma [7| The 
trace normalization reduces the dimension by one. Then Lemma states 
that we need at most r orbits to construct the identity matrix.^ The preced- 
ing discussion shows the following theorem. 

■^The trace normalization reduces the dimension (P of the space of Hermitian matrices 
by one. 

•^A consequence of the decomposition of the POVM is that some operators are de- 
composed into several copies. Lemma states that this does not change the mutual 
information. 




of the operators of P. We have the convex combination AjCj = P'^ with 
Xi := tT(Jli)/d as in Def. IT^ We consider the orbit sums 



g&G 
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Theorem 15. Let S be a symmetric ensemble with a as defined in Eq. 
Then there exists an optimal measurement with rank-one operators which is 
the union of at most mf orbits. 

The theorem can also be apphed if we restrict the symmetry to sub- 
groups of the symmetry group since the action of the group must not be 
transitive. However, by this reduction the bound on the number of orbits 
becomes weaker since the number of different irreducible representations de- 
creases while the multiplicities increase. Therefore, for the solution of 
optimization problems it is beneficial to take as much symmetry as possible. 
As an extreme case, this theorem can be applied to a representation of the 
trivial group.'* Then we have mi = d for the only irreducible representation 
g ^ (1) leading to the upper bound d"^. This discussion shows that Davies' 
first theorem can be obtained as special case of the generalized theorem. 

Both theorems have real versions [S]. The bound of the first theorem can 
be tightened io n < d{d + l)/2 since we can transform an optimal POVM 
into a POVM with real operators. Hence, the subspace K of Lemma IT^ does 
not contain linear combinations of the elements Ym of Eq. (0). Additionally, 
the discussion for the second theorem is also valid if we replace the Ki of 
Eq. (j2I) with the real irreducible representations. We obtain the upper bound 
Y^^mi{mi + l)/2 where the rrii are the multiplicities of the real irreducible 
representations. 

5 Examples 

We apply the real version of Th. to the following ensembles in order to 
obtain optimal POVMs: an ensemble of slightly lifted trines and the double 
trines. The theorem leads to an optimization problem that is a special case of 
those in Refs. [HE]. We identify optimal POVMs and discuss their properties. 
For the slightly lifted trines we conclude as in Refs. |5 that a symmetric 
optimal POVM must at least comprise two orbits. For the double trines we 
obtain an optimal POVM consisting of a single orbit. 

^ Since each orbit under this symmetry comprises a single state the prior probabihties 
of the states can be chosen arbitrarily. 
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5.1 Lifted trines 



For each a G [0, 1] the three vectors 

v^r^ , -\^fY^ , and -iv^r^ 

constitute a hfted trines ensemble. These ensembles are interesting since for 
shghtly hfted trines, i.e., a is next to zero, it can be numericaUy shown that 
two orbits are necessary to obtain an optimal POVM [HE]. This shows that 
the direct generalization of Davies' theorem to reducible representations is 
not possible and that the bound of Th. can be attained. 

We follow Refs. E] and show in detail the analysis of optimal measure- 
ments for a special case of the lifted trines. The symmetry group of the lifted 
trines is generated by the rotation 

/ 2 \ 
i?:=- -1 

Ho -V3 -1 J 

about 120 degrees. This representation of the symmetry group contains two 
inequivalent real irreducible representations. Each irreducible representation 
has the multiplicity one. Hence, using the real version of Th. we need 
at most two orbits Ci = {U, RliR^, R^IiR} and C2 = {ft, RUR^, R^IIR} 
with operators 11 and 11 of rank one to obtain an optimal POVM. With the 
normalization tr(n) = tr(n) = 1 the convex combination P = ACi + (l — A)C2 
is a POVM for appropriate 11, 11, and A G [0, 1]. We apply Lemma ITUl and 
obtain^ the information I{S, P) = XI{S, Ci) + {l — X)I{S, C2), i.e., the mutual 
information of a convex combination of orbits is the convex combination 
of the formal mutual informations I{S,Ci) and /(S", C2). For an operator 
n = |\E'(a, 6)) (^'(a, 6) I we use the parameterization 

(cos(a) \ 
sin(a)cos(6) (4) 
sin(a)sin(6) J 

^Lemma 10 can be applied to orbits, too. However, we must replace ^^Pij by the 
prior probability p(i) in Eq. Q since '^jPij = p{i) need not hold for a single orbit. For 
an orbit which is a POVM both definitions coincide. 



11 



leading to the orbit sum 



i=0 



2 




3 
2 





I cos^(a) 




3 
2 






I cos^(a) 



) 



For two orbits Ci and C2 with parameters (a, b) and (c, d) the convex com- 
bination \Ci + (1 — A)C2 is a POVM, i.e., the sum of all operators equals J3, 
if and only if 



If we assume cos^(a) < cos^(c) this means that 1/3 G [cos^(a), cos^(c)], i.e., 
cos^(a) G [0,1/3] and cos^(c) G [1/3,1] are all possible values. In the fol- 
lowing we only consider the mutual information /(a, b) of the orbit with 
a = aTCCOS^/x, x G [0, 1], and b G [0, 27r/3]. This is sufficient since for a given 
value cos(a) with cos^(a) = a; we have the four possible values cos(a) = ±v^ 
and sin (a) — x in the vector |\E'(a,6)) of Eq. We denote these 

combinations of signs by ++, H — , — h, and . The case leads to 

the same information as ++ since the corresponding vectors differ only by 
a global phase. With the same argument the cases — h and H — lead to the 
same mutual information. For H — the vector has a minus sign in the last 
two components. Hence, we have the same information as for ++ where 
we replace 6 by 6 + vr. This discussion shows that the optimization with 
(arccosi/x, 6) for x G [0,1] and b G [0,27r/3] takes all possible values into 
account.® 

We restrict our attention to a = 1/20, i.e., to an example of slightly 
lifted trines. In Figs. Q and |21 the information /(arccosy^, b) of an orbit with 
parameters (arccosy^, b) is shown. Due to the symmetry each probability 
Pij is equal to the probability piu for a certain k. Hence, the information of 
the orbit is 

/(arccosVa?, b) = 3{H{pn) + if (^12) + H^pis) - H{pn + pu + pis)) + log23. 

The condition of Eq. means that the points (a, b) and (c, d) we choose 
for our orbits lie on different sides of the plane x = 1/3. From the figures it 
follows that a single orbit cannot be optimal since a POVM with a single orbit 
corresponds to a point on this plane. More precisely, the optimal information 

^The values of b can be restricted due to the symmetry. 



A cos^(a) + (1 — A) cos^(c) 



1 



(5) 



3 
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Figure 1: The mutual information I {a.Tccos^/x , b) for the hfted trines with 
a = 1/20. The information of an orbit can be negative. However, as Fig. |21 
shows the convex combination of the information of two points on different 
sides of the plane x = 1/3 leads to a non- negative information. 




Figure 2: Fig. ^viewed along the 6-axis shows the maximum of I{aIccos^/x, h) 
for each x whereas h remains a variable. This maximum is slightly convex in 
the interval [0, 0.3831] of the x-axis. 
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Figure 3: The function J(z/, b) has a global maximum at 6 ~ 0.1377. It seems 
to be complicated to give a simple analytic definition of this value. The 
partially negative function is —{d/ dh)I{v, h)/2. The remaining values can be 
obtained by virtue of the symmetry of the function. 

we can obtain for points on this plane is 0.8456 bit as shown in Fig. El This 
can be obtained for the POVM with (i/, b) where v := arccosA/l/3 and 
h ~ 0.1377. The slight convexity in Fig. El as well as Refs. jUE] suggest that 
we can obtain more information with two points: a point with x = and 
a point with 6 = on the other side of the plane. Numerical computations 
show that an optimal point for x = is (7r/2,7r/2) with the information 
0.15996 bit. The other optimal point can be chosen to be (arccos a/0.3831, 0) 
with 0.9499 bit. The convex combination of both informations is 0.8472 bit. 
This is more than the information of the optimal single orbit. 

In the following we show that the accessible information cannot be ob- 
tained with a POVM which is a single orbit even if we consider operators 
of higher rank. Consequently, the characterization [T71 ITH] of the extreme 
points of the convex set of POVMs consisting of a single orbit cannot be 
applied. Assume that P := {R'HR-' : i = 0,1,2} is an optimal POVM 
with initial operator 11 = X]i=i where (\E'j|\E'i) = 1, = ^l^^ 

Si > 0. Then Lemma 2 of Ref. |2I and Lemma ITUl state that 
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with Pi := {W\-^i){<I/i\R-^ : j = 0,1,2}. The POVM ^.i^iPi consists of 
three orbits. Using Th. Elwe construct an optimal POVM with two of these 
three orbits. Without loss of generality we assume that the orbits correspond 
to |\E'i)(\E'i| and |\E'2) (\1'2|- The two corresponding points (a, 6) must be the 
optimal points^ given above. The probability vectors (0.2375, 0, 0.2375) for 
and (0.2724,0.0199,0.0199) for |^2)(^2| are not equal up to a con- 
stant factor. Therefore, following Lemma IHl inequality (jH)) is strict, i.e., the 
single orbit cannot be an optimal POVM. 



5.2 Double trines 



The double trines CD] are defined by the three state vectors 
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of two qubits^. We apply the unitary basis transform 
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and obtain the state vectors 
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^The point (7r/2,7r/6) leads to the same results as (7r/2,7r/2). 

^Compared to the symmetry of the lifted trines in Sec. l5.ll we have the additional sym- 
metry operation that interchanges the qubits. Even with this operation the representation 
of the symmetry group is reducible. We do not consider this symmetry operation in the 
following since the decomposition of the representation does not become simpler. 
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Figure 4: The mutual information /(arccosy^, b) for the double trines en- 
semble. The horizontal plane corresponds to the information 1.369 bit which 
is the information of the pretty good measurement. 



We omit the last component^ and obtain the lifted trines with a = 1/2. In 
contrast to the previous section these trines are strongly lifted. As mentioned 
in Refs. this leads to different properties of optimal POVMs. We replace 
a = 1/20 by a = 1/2 in the computations of Sec. 15.11 and obtain Figs. 0] 
and El where the information /(arccosi/x, b) is shown. An optimal POVM 
can be obtained with a convex combination of at most two orbits. It follows 
from Fig. that a single orbit with x = 1/3 is optimal since the convex 
combination of the information of two points on different sides of the plane 
X = 1/3 is strictly below the maximum of the information on this plane. 
Computations show that /(i/, b) with u = arccos-^1/3 has an extreme point 
at 6 = 0. Hence, an optimal point on this plane is (a, b) = {u, 0) leading to 
the information 

, , 2v^7 - 9 ln(2) 
^<'--°'= 6H2) 



^An optimal POVM operating on the four dimensions can be projected to a POVM on 
the three dimensions. This projection does not change the mutual information. 
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1 0.8 0.6 0.4 0.2 

X 

Figure 5: Fig. E] viewed along the 6-axis shows the maximum of /(arccosi/x, b) 
for each x whereas b remains a variable. 

with 7 = In (2(3 + 2-\/2)^) • This is equal to the information obtained in 
Refs. jniE] for the pretty good measurement [inj. Furthermore, the Hessian 






6-(2+V2)7 
31n(2) 



-7.221 






-4.041 



of /(arccosv^, &) is negative definite at the point (x, 6) = (1/3,0), i.e., the 
information is concave in this region. These arguments and the global prop- 
erties of I {aIccos^/x , b) which can be seen in Figs. |3] and El suggest that the 
POVM corresponding to the point {u, 0) is optimal. 



6 Conclusions 

We have generalized Davies' theorem for symmetric ensembles to reducible 
representations of the symmetry group. There always exists an optimal 
POVM consisting of at most Yli '^i orbits where the rrii are the multiplicities 
of the irreducible components in the representation of the symmetry group. 



17 



Acknowledgments 



The author acknowledges helpful discussions with D. Janzing. This work was 
supported by Landesstiftung Baden-Wiirttemberg gGmbH (AZ 1.1322.01). 

Appendix 

In this appendix we prove Lemma El with standard arguments of linear 
optimization. Consider a POVM on a rf-dimensional system with operators 
Hi, . . . , n„ where we write Ilj = Ajll'^ as in Def. ^1 We have the convex 
combination Ajll^ = J^. With respect to the basis of Eq. ((T)) this convex 
combination can be written as equation 

... 1\ 
Y J 

Here we write A = (Ai, . . . , A„)"^. The 1 on the right side of the equation is the 
all-one vector of length d + 1 and the is the all- zero vector of length d"^ — d. 
The matrices E G R"^^" and X, F G R(('i'-rf)/2)xn contain the coefficients of 
the real linear combinations 

d-l 

fc=0 k>l k>l 

of Ilj = (n^')fci where 3fJ and denote the real and imaginary part of a 
complex number. More precisely, the i-th column of E, X, and Y contains 
the coefficients U'^'^, 3?(nf ), and Q=(nf ), respectively. 

We discuss some elementary properties of the solutions Ce of EX = b 
with Aj > and the vector b consisting of d ones. The matrix E contains 
only non-zero columns and non-negative entries. 

Lemma 16. The set Ce is convex and compact. 

Proof. The set Ce is closed and convex. Assume that it is unbounded. Then 
following Th. 2.5.1 of Ref. it contains a ray, i.e., the set {p + fiq : fJ^ > 0} 
with q 0. Choose k with qk ^ 0. Since we have Aj > the vector q cannot 
contain negative entries. For all /i > we have E{p + fiq) = Ep + ^Eq = b. 



( 1 

I win prp n • = 





-DA = \ \ \ where D : 
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Hence, we have Ep = b and Eq = 0. The inequahty Eq > E^q^ holds where 
E}. denotes the /c-th column of E and > the element-wise relation. Since E 
does not contain a zero column we have Eq > Ekqk > in contradiction to 
Eq = 0. Hence, the set Ce cannot contain a ray. □ 

This lemma can also be applied to Cd since the additional equations 
restrict the set of solutions even more. Hence, Cd is convex and compact. 

Lemma 17. A compact convex subset o/R" is the convex hull of its extreme 
points. 

Proof. See Th. 2.4.5 of Ref. [Zj. □ 

Lemma 18. The set Cn has only a finite number of extreme points. For an 
extreme point A = (Ai, . . . , A„)^ of Cd we have at most rank(D) non-zero 
elements Aj. 

Proof. Following Th. 2.3 in Ref. jH] an extreme point of Cd corresponds to a 
feasible basic solution of -DA = c where c is the vector of Eq. (0) consisting 
of ones and zeros. Since the equation Yli^i^'i = shows that £z) is non- 
empty we can remove linear dependent rows of D without changing the set of 
solutions (see Th. 2.5 of Ref. |gj). Then Th. 2.4 of Ref. |9| states that a basic 
solution A has at most rank(Z)) non-zero entries. The number of extreme 
points is finite due to Corollary 2.1 of Ref. [0]. □ 

In the next lemma we show that rank(D) is bounded by the dimension 
of the space that contains all operators H'-. 

Lemma 19. Let H'j^, . . . , H^ G C"'^'^ with tr(H^) = d be elements of the affine 
space Id + K where K is a r -dimensional linear space of Hermitian matrices. 
Then the matrix D defined in Eq. ^ has at most rank r + 1. 

Proof. The matrix D without the first row has at most rank r + 1 since an 
affine space of dimension r is contained in a linear space of dimension r + 1. 
The first row does not increase the rank since it is linear dependent to the 
rows of E. This is due to the normalization tr(H^) = d which means that the 
sum of each column of E is d. □ 

With the lemmas of this appendix we prove Lemma El 
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Proof of Lemmam\ As in Eq. ([7j) we write J^i'^i^i = as D\ = c with 
the vector c of Eq. consisting of ones and zeros. Following Lemma El the 
matrix D has at most rank r + 1. Then Lemma ITHl states that an extreme 
point A of Cd has at most r + 1 non-zero elements. With Lemma ITBl we know 
that the solutions of DX = c constitute a convex and compact set which is 
the convex combination of its extreme points as stated in Lemma El CH 
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